Image capture device with a spherical capture mode and a non-spherical capture mode

ABSTRACT

An image capture device may switch operation between a spherical capture mode or a non-spherical capture mode. Operation of the image capture device in the spherical capture mode includes generation of spherical visual content based on the visual content generated by multiple image sensors. Operation of the image capture device in the non-spherical capture mode includes generation of non-spherical visual content based on visual content generated by a single image sensor.

FIELD

This disclosure relates to an image capture device with a sphericalcapture mode and a non-spherical capture mode.

BACKGROUND

A spherical camera may include multiple image sensors to capture360-degree scene around the camera. Images captured by different imagesensors may be stitched together to generate a spherical video. A userusing a spherical camera may not wish to capture a spherical video.

SUMMARY

This disclosure relates to an image capture device with a sphericalcapture mode and a non-spherical capture mode. An image capture devicemay include a housing. The housing may carry one or more of a firstimage sensor, a second image sensor, a first optical element, a secondoptical element, a processor, an electronic storage, and/or othercomponents. The first image sensor may generate a first output signalconveying first visual information defining first visual content basedon light guided thereto by the first optical element, and the secondimage sensor may generate a second output signal conveying second visualinformation defining second visual content based on light guided theretoby the second optical element. The processor(s) may be configured tooperate the image capture device in a spherical capture mode or anon-spherical capture mode. Operation of the image capture device in thespherical capture mode may include generation of spherical visualcontent based on the first visual content and the second visual content.Operation of the image capture device in the non-spherical capture modemay include generation of non-spherical visual content based on thefirst visual content or the second visual content.

An electronic storage may store visual information defining visualcontent, information relating to visual content, spherical visualinformation defining spherical visual content, information relating tospherical visual content, non-spherical visual information definingnon-spherical visual content, information relating to non-sphericalvisual content, information relating to operation of the image capturedevice, information relating to the spherical capture mode, informationrelating to the non-spherical capture mode, and/or other information.Visual content may refer to content of one or more images and/or one ormore videos that may be consumed visually. For example, visual contentmay be defined within one or more images and/or one or more video framesof a video. Visual content may be stored in one or more formats and/orone or more containers.

The housing may carry one or more components of the image capturedevice. The housing may carry (be attached to, support, hold, and/orotherwise carry) one or more of a first image sensor, a second imagesensor, a first optical element, a second optical element, a processor,an electronic storage, and/or other components.

The first image sensor may be configured to generate a first outputsignal and/or other output signals. The first output signal may conveyfirst visual information based on light that becomes incident thereonand/or other information. The first visual information may define firstvisual content. The second image sensor may be configured to generate asecond output signal and/or other output signals. The second outputsignal may convey second visual information based on light that becomesincident thereon and/or other information. The second visual informationmay define second visual content.

The first optical element may be configured to guide light within afirst field of view to the first image sensor. The first field of viewmay be greater than 180 degrees. The second optical element may beconfigured to guide light within a second field of view to the secondimage sensor. The second field of view may be greater than 180 degrees.

The second optical element may be carried by the housing such that aperipheral portion of the first field of view and a peripheral portionof the second field of view overlap. The overlap of the peripheralportion of the first field of view and the peripheral portion of thesecond field of view may enable spherical capture of visual contentbased on the first visual content and the second visual content.

The processor(s) may be configured by machine-readable instructions.Executing the machine-readable instructions may cause the processor(s)to facilitate operation of the image capture device. Themachine-readable instructions may include one or more computer programcomponents. The computer program components may include one or more of aspherical mode component, a non-spherical mode component, a switchcomponent, and/or other computer program components.

The spherical mode component may be configured to operate the imagecapture device in the spherical capture mode. An operation of the imagecapture device in the spherical capture mode may include generation ofspherical visual content based on the first visual content, the secondvisual content, and/or other visual content.

The non-spherical mode component may be configured to operate the imagecapture device in the non-spherical capture mode. An operation of theimage capture device in the non-spherical capture mode may includegeneration of non-spherical visual content based on the first visualcontent and/or other visual content.

The switch component may be configured to switch the operation of theimage capture device between the spherical capture mode and thenon-spherical capture mode.

In some implementations, the operation of the image capture device inthe non-spherical capture mode may further include inclusion of anextent of the first visual content within the non-spherical visualcontent. The extent of the first visual content may be determined basedon a gaze of a user within the second visual content.

In some implementations, the operation of the image capture device inthe non-spherical capture mode may further include change in a focus ofthe image capture device for the generation of non-spherical visualcontent. The focus of the image capture device may be changed based on agaze of a user within the second visual content.

In some implementations, the operation of the image capture device inthe non-spherical capture mode may further include combination of aportion of the first visual content and a portion of the second visualcontent within the non-spherical visual content. The portion of thefirst visual content and the portion of the second visual content may betime-synchronized based on operation of the first image sensor and thesecond image sensor by the image capture device.

In some implementations, the portion of the first visual content and theportion of the second visual content may be spatially arranged withinthe non-spherical visual content for simultaneous presentation.

In some implementations, the portion of the first visual content and theportion of the second visual content may be temporally arranged withinthe non-spherical visual content for sequential presentation.

In some implementations, the portion of the first visual content and theportion of the second visual content may be temporally arranged forsequential presentation based on audio associated with the second visualcontent. The portion of the first visual content and the portion of thesecond visual content may be temporally arranged for sequentialpresentation based on identification of first depiction of interestwithin the portion of the first visual content and second depiction ofinterest within the portion of the second visual content.

In some implementations, the sequential presentation of the portion ofthe first visual content and the portion of the second visual contentmay include a panning transition from the first depiction of interestwithin the first visual content to the second depiction of interestwithin the second visual content. The panning transition from the firstdepiction of interest within the first visual content to the seconddepiction of interest within the second visual content may includestitching of the first visual content and the second visual content.

These and other objects, features, and characteristics of the systemand/or method disclosed herein, as well as the methods of operation andfunctions of the related elements of structure and the combination ofparts and economies of manufacture, will become more apparent uponconsideration of the following description and the appended claims withreference to the accompanying drawings, all of which form a part of thisspecification, wherein like reference numerals designate correspondingparts in the various figures. It is to be expressly understood, however,that the drawings are for the purpose of illustration and descriptiononly and are not intended as a definition of the limits of theinvention. As used in the specification and in the claims, the singularform of “a,” “an,” and “the” include plural referents unless the contextclearly dictates otherwise.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 illustrates a system that operates an image capture device with aspherical capture mode and a non-spherical capture mode.

FIG. 2 illustrates a method for operating an image capture device with aspherical capture mode and a non-spherical capture mode.

FIG. 3 illustrates an example spherical visual content.

FIG. 4 illustrates example viewing directions for spherical videocontent.

FIGS. 5A-5B illustrate example extents of spherical visual content.

FIG. 6 illustrates an example image capture device.

FIG. 7A illustrates an example generation of spherical visual content.

FIG. 7B illustrates an example generation of non-spherical visualcontent.

FIG. 8A illustrates examples of gaze detection and capture extent.

FIG. 8B illustrates examples of gaze detection, capture extent, andfocus point.

FIG. 8C illustrates examples of capture extents.

FIG. 9A illustrates an example spatial arrangement of visual contentpresentation.

FIG. 9B illustrates an example spatial arrangement of visual contentpresentation.

FIG. 9C illustrates an example timing arrangement of visual contentpresentation.

FIG. 9D illustrates an example timing arrangement of visual contentpresentation.

FIG. 9E illustrates an example panning between depictions of interest.

DETAILED DESCRIPTION

FIG. 1 illustrates a system 10 for operating an image capture devicewith a spherical capture mode and a non-spherical capture mode. Thesystem 10 may include or be part of an image capture device. The system10 may include one or more of a processor 11, an interface 12 (e.g.,bus, wireless interface), an electronic storage 13, an optical element A14B, an optical element B 14B, an image sensor A 15A, and image sensor B15B, and/or other components. One or more components of the system 10may be carried by a housing of the image capture device. The imagesensor A 15A may generate a first output signal conveying first visualinformation defining first visual content based on light guided theretoby the optical element A 14A, and the image sensor B 15B may generate asecond output signal conveying second visual information defining secondvisual content based on light guided thereto by the optical element B14B. The processor 11 may be configured to operate the image capturedevice in a spherical capture mode or a non-spherical capture mode.Operation of the image capture device in the spherical capture mode mayinclude generation of spherical visual content based on the first visualcontent and the second visual content. Operation of the image capturedevice in the non-spherical capture mode may include generation ofnon-spherical visual content based on the first visual content or thesecond visual content.

The electronic storage 13 may be configured to include electronicstorage medium that electronically stores information. The electronicstorage 13 may store software algorithms, information determined by theprocessor 11, information received remotely, and/or other informationthat enables the system 10 to function properly. For example, theelectronic storage 13 may store visual information defining visualcontent, information relating to visual content, spherical visualinformation defining spherical visual content, information relating tospherical visual content, non-spherical visual information definingnon-spherical visual content, information relating to non-sphericalvisual content, information relating to operation of the image capturedevice, information relating to the spherical capture mode, informationrelating to the non-spherical capture mode, and/or other information.

Visual content may refer to content of one or more images and/or one ormore videos that may be consumed visually. For example, visual contentmay be defined within one or more images and/or one or more video framesof a video. For instance, video frame(s) of a video may define thevisual content of the video. Video frame(s) may define visual contentviewable as a function of progress through a progress length of thevideo. A video frame may include an image of the video at a momentwithin the progress length of the video. Visual content may be generatedbased on light received within a field of view of a single image sensoror within fields of view of multiple image sensors during a captureperiod.

Visual content may be stored in one or more formats and/or one or morecontainers. A format may refer to one or more ways in which theinformation defining visual content is arranged/laid out (e.g., fileformat). A container may refer to one or more ways in which informationdefining visual content is arranged/laid out in association with otherinformation (e.g., wrapper format). Information defining visual content(visual information) may be stored within a single file or multiplefiles. For example, visual information defining an image or video framesof a video may be stored within a single file (e.g., image file, videofile), multiple files (e.g., multiple image files, multiple videofiles), a combination of different files, and/or other files.

Visual content may include spherical visual content. Spherical visualcontent may include visual content viewable from one or more points ofview (e.g., within a sphere, center of a sphere). Spherical visualcontent may refer to visual content generated through capture ofmultiple views from a single location. Spherical visual content may becaptured through the use of one or more image capture devices to captureimages from a location. The captured images may be stitched together toform the spherical visual content (spherical image). Spherical visualcontent may include full spherical visual content (360 degrees ofcapture) or partial spherical visual content (less than 360 degrees ofcapture). Partial spherical visual content may be referred to aspanoramic visual content.

Content of one or more videos may be referred to as video content. Videocontent may have a progress length. That is, a video may include videocontent having a progress length. A progress length may be defined interms of time durations and/or frame numbers. For example, video contentof a video may have a time duration of 60 seconds. Video content of avideo may have 1800 video frames. Video content having 1800 video framesmay have a play time duration of 60 seconds when viewed at 30 frames persecond. Other progress lengths, time durations, and frame numbers arecontemplated.

Video content may include visual content, audio content, and/or othercontent. For example, video content may include visual content viewableas a function of progress through the progress length of the videocontent, audio content playable as a function of progress through theprogress length of the video content, and/or other content that may beplayed back as a function of progress through the progress length of thevideo content.

Audio content may include audio/sound captured (e.g., by soundsensor(s), microphone(s)) with the capture of the visual content and/oraudio/sound provided as an accompaniment for the visual content. Audiocontent may include one or more of voices, activities, songs, music,and/or other audio/sounds. For example, audio content may include soundscaptured by a single sound sensor or an array of sound sensors. Thesound sensor(s) may receive and convert sounds into output signals. Theoutput signals may convey sound information and/or other information.The sound information may define audio content in one or more formats,such as WAV, MP3, MP4, RAW. Audio content may include audio/soundgenerated by one or more computing devices, such as procedural audio.Audio content may be synchronized with the visual content. For example,audio content may include music, song, and/or soundtrack, and the visualcontent of the video content may be synchronized with music, song,and/or soundtrack.

Video content may include one or more of spherical video content,virtual reality content, and/or other video content. Spherical videocontent and/or virtual reality content may include visual contentviewable from one or more points of view as a function of progressthrough the progress length of the spherical/virtual reality videocontent.

Spherical video content may refer to video content generated throughcapture of multiple views from a single location. Spherical videocontent may be captured through the use of one or more image capturedevices to capture images/videos from a location. The capturedimages/videos may be stitched together to form the spherical videocontent. Spherical video content may include full spherical videocontent (360 degrees of capture) or partial spherical video content(less than 360 degrees of capture). Partial spherical video content maybe referred to as panoramic video content.

Visual content of spherical video content may be included within one ormore spherical video frames of the spherical video. The spherical videoframe(s) may define the visual content of the video content. That is,spherical video content may include spherical video frame(s) that definethe visual content of the spherical video content. Spherical videoframe(s) may define visual content viewable from a point of view (e.g.,within a sphere, center of a sphere) as a function of progress throughthe progress length of the spherical video content.

A spherical video frame may include a spherical image of the sphericalvideo content at a moment within the progress length of the sphericalvideo content. Visual content of spherical video content may begenerated based on light received within a field of view of a singleimage sensor or within fields of view of multiple image sensors during acapture period. For example, multiple images/videos captured by multiplecameras/image sensors may be combined/stitched together to form thevisual content of the spherical video content. The field of view ofcamera(s)/image sensor(s) may be moved/rotated (e.g., viamovement/rotation of optical element(s), such as lens, of the imagesensor(s)) to capture multiple images/videos from a location, which maybe combined/stitched together to form the visual content of thespherical video content.

For example, multiple images captured by multiple cameras/images sensorsat a moment in time may be combined/stitched together to form aspherical video frame for the moment in time. A spherical video framemay include a full spherical image capture (360-degrees of capture,including opposite poles) or a particular spherical image capture (lessthan 360-degrees of capture). A spherical image (e.g., spherical videoframe) may be comprised of multiple sub-images (sub-frames). Sub-imagesmay be generated by a single image sensor (e.g., at different times asthe field of view of the image sensor is rotated) or by multiple imagesensors (e.g., individual sub-images for a moment in time captured byindividual image sensors and combined/stitched together to form thespherical image).

In some implementations, spherical video content may be stored with a5.2K resolution. Using a 5.2K spherical video content may enable viewingwindows (e.g., directed to a portion of a spherical video frame) for thespherical video content with resolution close to 1080p. In someimplementations, spherical video content may include 12-bit videoframes. Other sizes and qualities of spherical video content arecontemplated.

FIG. 3 illustrates an example spherical visual content 300. Thespherical visual content 300 may include content of a spherical image ora spherical video. The spherical visual content 300 may include visualcontent viewable from a point of view (e.g., center of sphere) as afunction of progress through the progress length of the spherical visualcontent 300. FIG. 3 illustrates example rotational axes for thespherical visual content 300. Rotational axes for the spherical visualcontent 300 may include a yaw axis 310, a pitch axis 320, a roll axis330, and/or other axes. Rotations about one or more of the yaw axis 310,the pitch axis 320, the roll axis 330, and/or other axes may definedirections of view (e.g., viewing directions) for the spherical visualcontent 300.

For example, a 0-degree rotation of the spherical visual content 300around the yaw axis 310 may correspond to a front viewing direction. A90-degree rotation of the spherical visual content 300 around the yawaxis 310 may correspond to a right viewing direction. A 180-degreerotation of the spherical visual content 300 around the yaw axis 310 maycorrespond to a back-viewing direction. A −90-degree rotation of thespherical visual content 300 around the yaw axis 310 may correspond to aleft viewing direction.

A 0-degree rotation of the spherical visual content 300 around the pitchaxis 320 may correspond to a viewing direction that may be level withrespect to horizon. A 45-degree rotation of the spherical visual content300 around the pitch axis 320 may correspond to a viewing direction thatmay be pitched up with respect to horizon by 45-degrees. A 90-degreerotation of the spherical visual content 300 around the pitch axis 320may correspond to a viewing direction that may be pitched up withrespect to horizon by 90-degrees (looking up). A −45-degree rotation ofthe spherical visual content 300 around the pitch axis 320 maycorrespond to a viewing direction that may be pitched down with respectto horizon by 45-degrees. A −90-degree rotation of the spherical visualcontent 300 around the pitch axis 320 may correspond to a viewingdirection that may be pitched down with respect to horizon by 90-degrees(looking down).

A 0-degree rotation of the spherical visual content 300 around the rollaxis 330 may correspond to a viewing direction that may be upright. A90-degree rotation of the spherical visual content 300 around the rollaxis 330 may correspond to a viewing direction that may be rotated tothe right by 90-degrees. A −90-degree rotation of the spherical visualcontent 300 around the roll axis 330 may correspond to a viewingdirection that may be rotated to the left by 90-degrees. Other rotationsand viewing directions are contemplated.

A playback of video content (e.g., the spherical video content) mayinclude presentation of one or more portions of visual content on one ormore displays based on a viewing window and/or other information. Aviewing window may define extents of the visual content viewable on oneor more displays as the function of progress through the progress lengthof the video content. The viewing window may define extents of thevisual content presented on the display(s) as the function of progressthrough the progress length of the video content. For spherical videocontent, the viewing window may define extents of the visual contentviewable from the point of view as the function of progress through theprogress length of the spherical video content.

The viewing window may be characterized by viewing directions, viewingsizes (e.g., viewing zoom, viewing magnification), viewing rotations,and/or other information. A viewing direction may define a direction ofview for video content. A viewing direction may define the angle/visualportion of the video content at which the viewing window may bedirected. A viewing direction may define a direction of view for thevideo content selected by a user, defined by instructions for viewingthe video content, and/or determined based on other information aboutviewing the video content as a function of progress through the progresslength of the video content (e.g., director track specifying viewingdirection to be presented during playback as a function of progressthrough the progress length of the video content). For spherical videocontent, a viewing direction may define a direction of view from thepoint of view from which the visual content may be defined. Viewingdirections for the video content may be characterized by rotationsaround the yaw axis 310, the pitch axis 320, the roll axis 330, and/orother axes. For example, a viewing direction of a 0-degree rotation ofthe video content around a yaw axis (e.g., the yaw axis 310) and a0-degree rotation of the video content around a pitch axis (e.g., thepitch axis 320) may correspond to a front viewing direction (the viewingwindow may be directed to a forward portion of the visual contentcaptured within the spherical video content).

For example, FIG. 4 illustrates example changes in viewing directions400 (e.g., selected by a user for video content, specified by adirector's track) as a function of progress through the progress lengthof the video content. The viewing directions 400 may change as afunction of progress through the progress length of the video content.For example, at 0% progress mark, the viewing directions 400 maycorrespond to a zero-degree yaw angle and a zero-degree pitch angle. At25% progress mark, the viewing directions 400 may correspond to apositive yaw angle and a negative pitch angle. At 50% progress mark, theviewing directions 400 may correspond to a zero-degree yaw angle and azero-degree pitch angle. At 75% progress mark, the viewing directions400 may correspond to a negative yaw angle and a positive pitch angle.At 87.5% progress mark, the viewing directions 400 may correspond to azero-degree yaw angle and a zero-degree pitch angle. The viewingdirections 400 may define a path of movement for the viewing window(e.g., a trajectory followed by the viewing window) as a function ofprogress through the progress length of the video content. Other viewingdirections are contemplated.

A viewing size may define a size of the viewing window. A viewing sizemay define a size (e.g., size, magnification, viewing angle) of viewableextents of visual content within the video content. A viewing size maydefine the dimensions of the viewing window. A viewing size may define asize of viewable extents of visual content within the video contentselected by a user, defined by instructions for viewing the videocontent, and/or determined based on other information about viewing thevideo content as a function of progress through the progress length ofthe video content (e.g., director track specifying viewing size to bepresented as a function of progress through the progress length of thevideo content). In some implementations, a viewing size may definedifferent shapes of viewable extents. For example, a viewing window maybe shaped as a rectangle, a triangle, a circle, and/or other shapes.

A viewing rotation may define a rotation of the viewing window. Aviewing rotation may define one or more rotations of the viewing windowabout one or more axis. In some implementations, a viewing rotation maybe defined by one or more parameters of a viewing direction. Forexample, a viewing rotation may be defined based on rotation about anaxis (e.g., the roll axis 330) corresponding to a viewing direction. Aviewing rotation may define a rotation of the viewing window selected bya user, defined by instructions for viewing the video content, and/ordetermined based on other information about viewing the video content asa function of progress through the progress length of the video content(e.g., director track specifying viewing rotation to be used as afunction of progress through the progress length of the video content).For example, a viewing rotation of a viewing window having a rectangularshape may determine whether the rectangular viewing window is to bepositioned in a portrait orientation (e.g., for a portrait view of thevideo content), in a landscape orientation (e.g., for a landscape viewof the video content), and/or other orientation with respect to thevisual content of the video content.

FIGS. 5A-5B illustrate examples of extents for spherical visual content500. In FIG. 5A, the size of the viewable extent of the spherical visualcontent 500 may correspond to the size of extent A 510. In FIG. 5B, thesize of viewable extent of the spherical visual content 500 maycorrespond to the size of extent B 520. Viewable extent of the sphericalvisual content 500 in FIG. 5A may be smaller than viewable extent of thespherical visual content 500 in FIG. 5B. The viewable extent of thespherical visual content 500 in FIG. 5B may be more tilted with respectto the spherical visual content 500 than viewable extent of thespherical visual content 500 in FIG. 5A. Other viewing sizes and viewingrotations are contemplated.

An image capture device may refer to a device captures visual content.An image capture device may capture visual content in form of images,videos, and/or other forms. For example, an image capture device mayrefer to a camera and/or an image sensor. FIG. 6 illustrates an exampleimage capture device 602. Visual content may be captured by the imagecapture device 602. The image capture device 602 may include a housing612, and the housing 612 may carry (be attached to, support, hold,and/or otherwise carry) one or more of an optical element A 604A, anoptical element B 604B, an image sensor A 606A, an image sensor B 606B,a processor 608, and/or other components. Other configurations of imagecapture devices are contemplated.

One or more components of the image capture device 602 may be the sameas, be similar to, and/or correspond to one or more components of thesystem 10. For example, the optical elements 604, 604B may be the sameas, be similar to, and/or correspond to the optical elements 14A, 14B.The image sensors 606A, 660B may be the same as, be similar to, and/orcorrespond to the image sensors 15A, 15B. The processor 608 may be thesame as, be similar to, and/or correspond to the processor 11.

The optical elements 604A, 604B may include instrument(s), tool(s),and/or medium that acts upon light passing through theinstrument(s)/tool(s)/medium. For example, the optical elements 604A,604B may include one or more of lens, mirror, prism, and/or otheroptical elements. The optical elements 604A, 604B may affect direction,deviation, and/or path of the light passing through the optical elements604A, 604B. While the optical elements 604A, 604B are shown in astaggered configuration, this is merely an example. In someimplementations, the optical elements 604A, 604B

The image sensors 606A, 606B may include sensor(s) that convertsreceived light into output (electrical) signals. The image sensors 606A,606B may generate output signals conveying information that definesvisual content of one or more images and/or one or more video frames ofa video. For example, the image sensors 606A, 606B may include one ormore of a charge-coupled device sensor, an active pixel sensor, acomplementary metal-oxide semiconductor sensor, an N-typemetal-oxide-semiconductor sensor, and/or other image sensors.

The image sensors 606A, 606B may be configured to generate outputsignals conveying visual information (defining visual content of imagesand/or videos) based on light that becomes incident thereon and/or otherinformation. The optical element A 604A may be configured to guide lightwithin a field of view to the image sensor A 606A, and the image sensorA 606A may be configured to generate output signals conveying visualinformation based on light that becomes incident thereon via the opticalelement A 604A. The optical element B 604B may be configured to guidelight within a field of view to the image sensor B 606B, and the imagesensor B 606B may be configured to generate output signals conveyingvisual information based on light that becomes incident thereon via theoptical element B 604B. The fields of view of the optical elements 604A,604B may refer to the extents of the observable world that is seenthrough the optical elements 604A, 604B. The field of views of theoptical elements 604A, 604B may include one or more angles (e.g.,vertical angle, horizontal angle, diagonal angle) at which light isreceived and passed on by the optical elements 604A, 604B. The fields ofview of the optical elements 604A, 604B may be the same or different.

The fields of view of the optical elements 604A, 604B may be greaterthan or equal to 180-degrees. The optical elements 604A, 604B may becarried by the housing 312 such that peripheral portions of the fieldsof view of the optical elements 604A, 604B overlap. The overlap of theperipheral portions of the fields of view of the optical elements 604A,604B may enable capture of spherical visual content (e.g., of imagesand/or videos) based on the visual information conveyed by the outputsignals of the image sensors 606A, 606B. That is, the overlap of theperipheral portions of the fields of view of the optical elements 604A,604B may enable capture of spherical visual content based on visualcontent defined by the visual information of the image sensor A 606A(visual content captured by the image sensor A 606A) and visual contentdefined by the visual information of the image sensor B 606B (visualcontent captured by the image sensor B 606B).

The processor 608 may include one or more processors (logic circuitry)that provide information processing capabilities in the image capturedevice 602. The processor 608 may provide one or more computingfunctions for the image capture device 602. The processor 608 mayoperate/send command signals to one or more components of the imagecapture device 602 to operate the image capture device 602. For example,the processor 608 may facilitate operation of the image capture device602 in capturing image(s) and/or video(s), facilitate operation of theoptical elements 604A, 604B (e.g., change how light is guided by theoptical elements 604A, 604B), and/or facilitate operation of the imagesensors 606A, 606B (e.g., change how the received light is convertedinto information that defines images/videos and/or how the images/videosare post-processed after capture). The processor 608 may include and/orperform one or more functionalities of the processor 11 shown in FIG. 1.

The image capture device 602 may include other sensors, such as one ormore motion sensors. A motion sensor may refer to one or more sensorsthat converts experienced motions into output signals. The outputsignals may include electrical signals. A motion sensor may generateoutput signals conveying information that characterizes motions and/orpositions of the motion sensor and/or device(s) carrying the motionsensor, such as the image capture device 602. The motions/positionscharacterized by a motion sensor may include translationalmotions/positions and/or rotational motions/positions. For example, amotion sensor may refer to a set of motion sensors, which may includeone or more inertial measurement units, one or more accelerometers, oneor more gyroscopes, and/or other motion sensors.

The image capture device 602 may be configured to operate in and switchbetween a spherical capture mode, a non-spherical capture mode, and/orother modes. For instance, the processor 608 may receive input(s) from auser to switch between different modes, detect conditions for whichoperation under one of the modes is desirable, and/or otherwisedetermine that the image capture device 602 should operate in aparticular mode. A mode may refer to a way or a manner in which theimage capture device 602 operates to capture images/videos. A sphericalcapture mode may refer to a way or a manner in which the image capturedevice 602 operates to capture spherical visual content (e.g., sphericalimages/videos). A non-spherical capture mode may refer to a way or amanner in which the image capture device 602 operates to capturenon-spherical visual content (e.g., non-spherical images/videos).

For example, when the image capture device 602 is operating in thespherical capture mode, the image capture device 602 may utilize lightreceived within fields of view of both optical elements 604A, 604B togenerate spherical images/videos. When the image capture device 602 isoperating in the non-spherical capture mode, the image capture device602 may utilize light received within one of the fields of view of theoptical elements 604A, 604B to generate non-spherical images/videos.That is, one of the optical elements 604A, 604B and/or one of the imagesensors 606A, 606B may be turned off/not operating to captureimages/videos. This may allow the image capture device 602 to operate ina lower-power mode (e.g., consuming less processing power, battery) thanwhen operating in the spherical capture mode.

Operation of the image capture device in the non-spherical capture modemay include use of light received within the fields of view of bothoptical elements 604A, 604B. For example, light received within thefield of view of the optical element B 604B may be used to generate (bythe image sensor B 606B) visual content for inclusion in thenon-spherical visual content while the light received within the fieldof view of the optical element 604A may be used to determine whichportion(s) of the visual content generated by the image sensor B 606Bare included in the non-spherical visual content. As another example,light received within the field of view of the optical element B 604Bmay be used to generate (by the image sensor B 606B) visual content forinclusion in the non-spherical visual content while the light receivedwithin the field of view of the optical element 604A may be used tocontrol the optical element B 604B and/or the image sensor B 606B. Forinstance, the visual content generated by the image sensor A 606A (basedon light guided thereto by the optical element A 604A) may be analyzedto determine which portion(s) of the visual content generated by theimage sensor B 606B are included in the non-spherical visual contentand/or how the optical element B 604B and/or the image sensor B 606B areoperated/controlled to generate visual content.

In some implementations, the processing resources (e.g., provided by theprocessor 608) of the image capture device 602 may be directed todifferent functions based on whether the image capture device 602 isoperating in the spherical capture mode or the non-spherical capturemode. For example, the processor 608 may include a single processingunit that reserves a portion of its processing capabilities forimages/videos captured by the image sensor A 606A and another portion ofits processing capabilities for images/videos captured by the imagesensor B 606B. The processor 608 may include multiple processing units(e.g., multiple chips) with different processor units dedicated todifferent image sensors 606A, 606B.

When the image capture device 602 is operating in the spherical capturemode, the respective portions of processing capabilities of theprocessor 608 (e.g., respective portion of processing capabilities,respective chip) may be in operation for spherical capture of visualcontent. When the image capture device 602 is operating in thenon-spherical capture mode, the portion of the processing resource ofthe processor 608 for the image sensor A 606A may be in operation forcapture of reduced visual content (including capture of smaller field ofview than spherical visual content) based on the visual informationconveyed by output signals of the image sensor A 606A. That is,operation in the non-spherical capture mode may cause the image capturedevice 602 to operate as a non-spherical image capture device.

In some implementations, the capture of the reduced visual content bythe image capture device 602 during the non-spherical capture mode mayenable presentation of a stabilized view of the reduced visual content.That is, the reduced visual content may be stabilized for viewing on oneor more displays. The stabilized view of the reduced content may have aviewing angle of at least 130-degrees. That is, the viewing window forthe stabilized reduced visual content may be sized to include at least130-degrees (e.g., 135-degrees) of view. Because the reduced visualcontent is captured based on light received within the optical element A604A having a field of view greater than or equal to 180-degrees (e.g.,195-degrees), the viewing window for the reduced visual content may havegreater freedom to move within the captured content to provide astabilized view.

When the image capture device 602 is operating in the non-sphericalcapture mode, the portion of the processing resource of the processor608 for the image sensor B 606B may not be in operation for capture ofreduced visual content based on the visual information conveyed byoutput signals of the image sensor B 606B. That is, a portion of theprocessing resource of the processor 608 reserved for image sensor B606B may not be used to process images/videos captured by the imagesensor B 606B. Instead, the portion of the processing resource of theprocessor 608 reserved for image sensor B 606B may be in operation forthe capture of reduced visual content based on the visual informationconveyed by output signals of the image sensor A 606A. That is, aportion of the processing resource of the processor 608 reserved forimage sensor B 606B may be used to process images/videos captured by theimage sensor A 606A. For example, such portion of the processingresource of the processor 608 may be used to perform imageclassification, image stabilization, and/or other image processing. Suchportion of the processing resource of the processor 608 may be used toaugment the image processing performed by the other portion of theprocessing resource (e.g., the portion reserved for the image sensor A606A) or to perform image processing separate/different from the imageprocessing performed by the other portion of the processing resource.

In some implementations, the image capture device 602 may be configuredto switch between the spherical capture mode, the non-spherical capturemode, and/or other modes based on the visual information of one or moreof the image sensors 606A, 606B. For example, the visual information ofone or more of the image sensors 606A, 606B may be analyzed by theprocessor 608 to determine whether the image capture device 602 shouldoperate in the spherical capture mode, the non-spherical capture mode,and/or other modes, or to change from one mode to another. For instance,an image generated through the image sensor B 606 may be dark/black(e.g., based on the optical element B 604B being covered) and theprocessor 608 may operate the image capture device 602 in thenon-spherical capture mode to capture reduced visual content using theoptical element A 604A and the image sensor A 606A. Such setting ofoperation mode for the image capture device 602 may enable a user tochange between spherical capture mode and non-spherical capture mode bysimply uncovering/covering one of the optical elements 604A, 604B,rather than by manually setting the operation of the image capturedevice 602 in a particular mode by interacting with interfaces (e.g.,physical/virtual buttons) of the image capture device 602.

Referring back to FIG. 1, the processor 11 may be configured to provideinformation processing capabilities in the system 10. As such, theprocessor 11 may comprise one or more of a digital processor, an analogprocessor, a digital circuit designed to process information, a centralprocessing unit, a graphics processing unit, a microcontroller, ananalog circuit designed to process information, a state machine, and/orother mechanisms for electronically processing information. Theprocessor 11 may be configured to execute one or more machine-readableinstructions 100 to facilitate operation of an image capture device,such as the image capture device 602 shown in FIG. 6. Themachine-readable instructions 100 may include one or more computerprogram components. The machine-readable instructions 100 may includeone or more of a spherical mode component 102, a non-spherical modecomponent 104, a switch component 106, and/or other computer programcomponents.

The spherical mode component 102 may be configured to operate an imagecapture device in a spherical capture mode and/or other modes. Aspherical capture mode may refer to a way or a manner in which the imagecapture device operates to capture spherical visual content (e.g.,spherical images/videos). An operation of the image capture device inthe spherical capture mode may include generation of spherical visualcontent based on the first visual content (defined by the first visualinformation, which may be conveyed by the first output signal generatedby the image sensor A 15A), the second visual content (defined by thesecond visual information, which may be conveyed by the second outputsignal generated by the image sensor B 15B), and/or other visualcontent. When the image capture device is operating in the sphericalcapture mode, the image capture device may utilize light received withinfields of view of both optical elements 14A, 14B to generate sphericalimages/videos.

For example, referring to FIG. 6, the operation of the image capturedevice 602 in the spherical capture mode may include the following: theimage sensor A 606A may generate a first output signal conveying firstvisual information defining first visual content based on light guidedthereto by the optical element A 604A. The image sensor B 606B maygenerate a second output signal conveying second visual informationdefining second visual content based on light guided thereto by theoptical element B 604B. Spherical visual content may be generated basedon the first visual content and the second visual content. The firstvisual content and the second visual content may individually includehemispherical visual content. Hemispherical visual content may includevisual content depicting at least half of a sphere (include 180-degreesor more of visual content capture).

For example, referring to FIG. 7A, the first visual content may includevisual content A 702 and the second visual content may include visualcontent B 704. Operation of the image capture device 602 in thespherical capture mode may include generation of spherical visualcontent 706 based on the visual content A 702, the visual content B 704,and/or other visual content. For instance, the spherical visual content706 may be generated based on a combination of the visual content A 702and the visual content B 704. One or more portions of the visual contentA 702 may be stitched with one or more portions of the visual content B704 to generate the spherical visual content 706. In some embodiments,the generation of the spherical visual content 706 may include encodingof the visual content A 702 stitched with the visual content B 704. Thatis, the spherical visual content 706 may be stored as a single sphere.In some embodiments, the generation of the spherical visual content 706may include separate storage of the visual content A 702 and the visualcontent B 704, with the visual content A 702 and the visual content B704 being stitched as needed to provide a view of the spherical visualcontent 706. That is, the spherical visual content 706 may be stored astwo halves of a sphere.

The non-spherical mode component 104 may be configured to operate theimage capture device in a non-spherical capture mode and/or other modes.A non-spherical capture mode may refer to a way or a manner in which theimage capture device 602 operates to capture non-spherical visualcontent (e.g., non-spherical images/videos, two-dimensionalimages/videos). An operation of the image capture device in thenon-spherical capture mode may include generation of non-sphericalvisual content based on the first visual content (defined by the firstvisual information, which may be conveyed by the first output signalgenerated by the image sensor A 15A) or the second visual content(defined by the second visual information, which may be conveyed by thesecond output signal generated by the image sensor B 15B), and/or othervisual content.

When the image capture device is operating in the non-spherical capturemode, the image capture device utilize light received within one of thefields of view of the optical elements 14A, 14B to generatenon-spherical images/videos. For instance, one of the optical elements14A, 14B and/or one of the image sensors 15A, 15B may be turned offand/or not operating to capture images/videos. This may allow the imagecapture device to operate in a lower-power mode (e.g., consuming lessprocessing power, battery) than when operating in the spherical capturemode.

For example, referring to FIG. 6, the operation of the image capturedevice 602 in the non-spherical capture mode may include the following:the image sensor A 606A may generate an output signal conveying visualinformation defining visual content based on light guided thereto by theoptical element A 604A. Non-spherical visual content may be generatedbased on the visual content. The visual content may includehemispherical visual content, such as the visual content A 702 (shown inFIG. 7A). Alternatively, the image sensor B 606B may generate an outputsignal conveying visual information defining visual content based onlight guided thereto by the optical element B 604B. Non-spherical visualcontent may be generated based on the visual content. The visual contentmay include hemispherical visual content, such as the visual content B704 (shown in FIG. 7B). Thus, visual content captured by different imagesensors 606A, 606B may be used to generate different non-sphericalvisual content. The image sensors 606A, 606B may capture visual contentbased on light received from different directions. For example, theimage sensor 606A may be a back-facing image sensor (selfie-image sensorto capture visual content of the user of the image capture device 602)while the image sensor 606B may be a front-facing image sensor(outward-facing image sensor to capture front of the image capturedevice 602)

The images/videos captured by the image capture device 602 duringoperation in the non-spherical capture mode may not be stitched togetherbut may rather be used for non-spherical image/video generation. Forinstance, the visual content A 702 may be used to generate non-sphericalvisual content A while the visual content A 702 may be used to generatenon-spherical visual content B.

When the image capture device is operating in the non-spherical capturemode, the image capture device may utilize visual content captured byone of the image sensors 15A, 15B to determine which portion(s) of thevisual content captured by the other image sensor are included in thenon-spherical visual content generated by the image capture device. Forexample, referring to FIG. 7B, the visual content A 702 may be used todetermine which portion(s) of the visual content B 704 are included thenon-spherical visual content generated by the image capture device.

For instance, the operation of the image capture device in thenon-spherical capture mode may include inclusion of one or more extents(spatial portions) of the visual content B 704 within the non-sphericalvisual content. The extent(s) of the visual content B 704 includedwithin the non-spherical visual content may be determined based on agaze of a user within the visual content A 702. For example, the visualcontent A 702 may include images/video frames captured by a selfie-imagesensor while the visual content B 704 may include images/video framescaptured by a front-facing image sensor. Which portion(s) of the includeimages/video frames captured by the front-facing image sensor areincluded within the non-spherical visual content may be determined bytracking user's gaze, which may be depicted within images/video framescaptured by the selfie-image sensor.

A user's gaze may refer to how a user of the image capture device islooking at something, such as whether the user is looking at aparticular object and/or an event steadily (e.g., the user is looking atan object or an event for a threshold amount of time). A user's gaze maybe characterized by one or more directions in which the user is looking.The direction(s) in which the user is looking may correspond to viewingdirection(s) for visual content (e.g., spherical visual content). Auser's gaze may be determined based on analysis of one or more bodyparts of the user. For example, a user's gaze may be determined based onvisual analysis of the location/orientation of the user's eyes, theuser's iris, the user's nose, the user's head, and/or other parts of theuser's head. The determination of user's gaze may be facilitated throughuse of other information about the user, such as information on theuser's body pose (e.g., the user extending a hand/finger in a particulardirection), information on the user's vocal cues about locations ofinterest (e.g., the user stating whether the location of interest is infront, to the left, to the right, behind, above, or below the user),and/or other information.

The images/video frames captured by the selfie-image sensor may be usedto track the user's gaze and automatically determine a punch-out of theimages/video frames captured by the front-facing image sensor. Forexample, the user's gaze tracked using the selfie-image sensor may beused to determine where interesting things are happening within theimages/video frames captured by the front-facing image sensor. Object ofinterest and/or event of interest detection in images/video framescaptured by the front-facing image sensor may be performed based onanalysis of images/video frames captured by the selfie-image sensor.Such generation of non-spherical visual content may enable generation ofnon-spherical visual content that includes depiction of objects (e.g.,material things, persons) and/or events (e.g., action, activity, motion)on which the user of the image capture device is focused (e.g., lookingat).

FIG. 8A illustrates examples of gaze detection and capture extent forspherical visual content. The spherical visual content in FIG. 8A may becomposed of visual content A 802 (e.g., back hemispherical visualcontent) and visual content B 804 (e.g., front hemispherical visualcontent). The visual content A 802 may be analyzed to detect the gaze ofa user. For example, one or more of the user's eyes, the user's iris,the user's nose, the user's head, other parts of the user's head, and/orother parts of the user's body may be detected to determine a spatialportion of the visual content A 802 that may be used for gaze detection.For instance, based on the location of depiction of the user's eyes, theuser's iris, the user's nose, the user's head, and/or other parts of theuser's head within the visual content A 802, a gaze detection 812 may beperformed within an extent of the visual content A 802.

The gaze detection 812 may determine the gaze of the user, such as inwhich direction(s) the user is looking. For example, the gaze detection812 may determine that the user is looking to the right of the front ofthe image capture device. Based on the gaze of the user, a punch-outincluding a capture extent 824 of the visual content B 804 may be usedto generate non-spherical visual content. Based on the gaze of the user,thing(s) (e.g., object(s), event(s)) of interest may be determined to bedepicted within the capture extent 824 of the visual content B 804. Thecapture extent 824 may be characterized by a viewing direction (e.g.,centered in the capture extent 834), a viewing size, and/or viewingrotation.

In some implementations, face classification may be used to determinethe gaze of the user and/or the location of thing(s) of interestdepicted within the visual content. Facial classification of visualcontent may include detection of face(s) depicted within the visualcontent and classification of the detected face(s). Classification of aface may include categorization and/or labeling of the face based on oneor more features of the face. Features of the face may includedistinctive attributes and/or aspects of the face. For example, featuresof the face may include one or more of face parts (e.g., eyes, eyelids,eyebrows, nose, cheeks, mouth, teeth), one or more positions and/ormovement of face parts (e.g., open mouth, mouth expression, closed eyes,gaze direction), one or more emotions and/or one or more expressionsdisplayed on the face, and/or other features of the face. Theclassification of a face may depend on the length for which one or morefeatures are maintained (e.g., length of expression display on theface).

Based on the classification of one or more faces, location of depictionof interest (depiction of things of interest) within the visual contentmay be identified within the spherical video. For example, one or morefaces within spherical visual content (e.g., shown in FIG. 8A) may beanalyzed to classify the face(s). Based on the types of expressions onface(s), the direction(s) of gaze, and/or the time(s) that theexpressions remain on face(s), the face(s) may be categories and/orlabeled into a particular classification. Based on the classification, adepiction of interest within the visual content may be identified asbeing located within the capture extent 824.

The non-spherical visual content may be generated to include aparticular edit of the spherical visual content. That is, portions ofthe spherical visual content may be edited for inclusion in thenon-spherical visual content. The non-spherical visual content mayinclude the depiction of interest (visual content within the captureextent 824). Based on the classification of face(s), the non-sphericalvisual content may also include one or more of the detected face(s). Forexample, one type of classification of face(s) may result in thedepiction of face(s) (visual content within which face(s) are depicted)being included within the non-spherical visual content while anothertype of classification of face(s) may result in the depiction of face(s)not being included within the non-spherical visual content.

For instance, a spectacular move by a player during a sporting event maycause particular reactions (e.g., cheering faces) on the faces of acrowd watching the sporting event. Based on the classification of thefaces, non-spherical visual content (two-dimensional visual content of avideo edit) of the sporting event may be generated to include apunch-out of the move by the player (with the duration and spatialextent of the punch-out determined based on the classification of thefaces), followed by a punch-out of the crowd reaction. Based on theclassification of the faces, non-spherical visual content(two-dimensional visual content of a video edit) of the sporting eventmay be generated to simultaneous include punch-outs of the player andthe crowd reaction. Based on the classification of the faces,non-spherical visual content (two-dimensional visual content of a videoedit) of the sporting event may be generated to include a punch-outs ofthe player, and no punch-out of the crowd. Thus, spherical visualcontent may be automatically edited into non-spherical visual contentbased on detection of faces, with different types of videoedits/non-spherical visual content being generated based on theclassification of detected faces.

The location of depiction of interest within visual content may bedetermined directly based on the gaze of the user and/or indirectlybased on the gaze of the user. For example, based on the gaze of theuser (as determined from the gaze detection 812), the depiction ofinterest may be determined to be located within the capture extent 824.Alternatively, based on the gaze of the user (as determined from thegaze detection 812), visual analysis and/or computer vision techniquesmay be performed one or more extents of the visual content B 804. Forexample, visual analysis/computer vision may be performed within anextent of the visual content B 804 that is larger than and includes thecapture extent 824. The visual analysis/computer vision may be performedto identify depiction(s) of interest. For example, based on the visualanalysis/computer vision, the depiction of interest may be identified asbeing located within the capture extent 824. In other words, the gaze ofthe user may be used to identify which portions of the visual contentmay be analyzed for depiction of interest identification. Suchlocalization of the visual analysis/computer vision within the visualcontent may provide for resource savings (e.g., computing power, time,memory, power consumption) compared to performing visualanalysis/computer vision over the entirety of the visual content (e.g.,entire spherical visual content).

When the image capture device is operating in the non-spherical capturemode, the image capture device may utilize visual content captured byone of the image sensors 15A, 15B to control the other the image sensorand/or the optical element corresponding to the other optical imagesensor. For example, referring to FIG. 8B, the visual content A 802 maybe used to control the image sensor and/or the optical element used tocapture the visual content B 804. For instance, a gaze detection 822 maybe performed within the visual content A 802 to determine that adepiction of interest is located within a capture extent 824 of thevisual content B 804. Based on the gaze of the user (as determined fromthe gaze detection 822), the focus of the image capture device may beset and/or changed for capture of the visual content B 804. For example,the focus of the image capture device may be directed to a focus point816. While the focus point 816 is located in the center of the captureextent 824 in FIG. 8B, this is merely an example and not mean to belimiting. A focus point may be located in non-center point of a captureextent.

In some implementations, the operation of the image capture device inthe non-spherical capture mode may include combination of two or moreportions of multiple visual content within the non-spherical visualcontent. For example, referring to FIG. 8C, non-spherical visual contentmay be generated to include one or more portions (spatial extent, timedurations) of the visual content A 802 and one or more portions of thevisual content B 804. For instance, non-spherical visual content may begenerated to include a portion of the visual content A 802 within acapture extent A 832 and a portion of the visual content B 804 within acapture extent B 834. The directions, sizes, and/or the rotations of thecapture extents 832, 834 may be determined based on gaze detection,visual analysis, computer vision technique, audio analysis, and/or otherinformation.

The portions of multiple visual content included in the non-sphericalvisual content may be time-synchronized based on operation of a singleimage capture device. That is, because the multiple visual content arecaptured by multiple image sensors of a single image capture device(e.g., the image sensors 606A, 606B of the image capture device 602),the multiple visual content may already be time-synchronized and theremay not be any need to synchronize timing of the multiple visualcontent. For example, the image capture device 602 may capture one setof images/video frames using the image sensor A 606A and another set ofimages/video frames using the image sensors B 606B. The two sets ofimages/video frames may be captured during the same capture duration andmay be associated with same capture times. Thus, the two sets ofimages/video frames are already time-synchronized, and the two sets ofimages/video frames may provide two feeds of visual content that aretime-synchronized with each other.

In some implementations, portions of multiple visual content may bespatially arranged within the non-spherical visual content forsimultaneous presentation. For example, images/video frames and/orportions of images/video frames captured by different image sensors maybe combined into a single view, such as in a picture-in-picturepresentation, a split-screen presentation, and/or other presentation.

FIG. 9A illustrates an example spatial arrangement of visual contentpresentation. The presentation shown in FIG. 9A may include apresentation of visual content A 902 within a presentation of visualcontent B 904. That is, the presentation shown in FIG. 9A may include apicture-in-picture presentation, where the inner picture are includesthe visual content A 902 and the outer picture area includes the visualcontent B 904.

FIG. 9B illustrates an example spatial arrangement of visual contentpresentation. The presentation shown in FIG. 9B may include apresentation of visual content A 912 next to a presentation of visualcontent B 914. That is, the presentation shown in FIG. 9B may include asplit-screen presentation, where the left picture area includes thevisual content A 912 and the right picture area includes the visualcontent B 914.

The arrangement of visual content within a presentation may bedetermined based on types of depictions within the visual content. Forexample, referring to FIG. 9A, the outer picture area may include thevisual content B 904 based on the visual content B 904 includingdepiction of things in front of an image capture device while the innerpicture area may include the visual content A 902 based on the visualcontent A 902 including depiction of things behind the image capturedevice, such as a user of the image capture device.

The non-spherical visual content may be stored for later playback,shared with one or more users and/or computing devices, and/orlive-streamed to one or more users and/or computer devices. For example,front and back images/video frames captured by a single spherical imagecapture device (e.g., 360-degree camera) may be combined intonon-spherical visual content. For instance, the front-facing imagesensor and the front-facing optical element may be used to capture acenter-weighted, stabilized, standard front image (e.g., 1080P, 1440P,other resolution). The front image may include visual content capturedthrough the entire field of view of the front-facing optical element orthe visual content capture through a portion of the field of view of thefront-facing optical element. The front image may be displayed on adisplay screen of the image capture device, which may allow the user toframe the front-shot as if a non-spherical image capture device werebeing used.

Multiple visual content captured by the spherical image capture devicemay be processed individually. Additionally, multiple visual contentcaptured by the spherical image capture device may be provided forpresentation and/or generation of non-spherical visual content withoutstitching, thereby requiring consumption of less resource than when thespherical image capture device is used to capture and generate sphericalvisual content.

The back-facing image sensor and the back-facing optical element may beused to capture a back image including a depiction of the user. The backimage may include visual content captured through the entire field ofview of the back-facing optical element or the visual content capturethrough a portion of the field of view of the back-facing opticalelement. For example, the back image may include the visual contentcaptured through a portion of the field of view that includes the user.Face and/or person tracking may be used to determine how the user isdepicted within the back image (user framing). User framing may bemaintained within the back image regardless of how the image capturedevice is being held by the user. That is, the portion of the visualcontent included in the back image may be determined to include aparticular shot of the user. Such a generation of back image may providean automatic selfie image of the user. Thus, a user would not have tomanually position themselves for a selfie-shot and may concentrate oncomposing the shot for the front image. The back image may be placed ina picture-in-picture arrangement with the front picture, such as shownin FIG. 9A.

The arrangement of visual content within a presentation may be changed.For example, referring to FIG. 9A, the visual content included withinthe outer picture area and the inner picture area may be changed. Forexample, a user may interact with one or more options presented on auser interface to manually set which visual content (which image/videofeed) is presented in the different positions. The placement of thevisual content may depend on inclusion of depiction of interest withinthe visual content. For example, based on a depiction of interestoccurring within the visual content A 902, the presentation of inner andouter picture areas may be changed so that depiction of interest ispresented within the outer (larger) picture area. The placement of thevisual content may depend on audio associated with the visual content.For example, based on audio recorded with the visual content indicatingthat a depiction of interest is occurring within the visual content A902, presentation of inner and outer picture areas may be changed sothat depiction of interest is presented within the outer (larger)picture area.

As another example, the extent of area taken up by the inner picturearea and/or the outer picture may change. For example, the size and/orthe shape of the inner picture area may change based on the depiction ofinterest within the visual content A 902. For instance, based on thevisual content A 902 including a depiction of interest (e.g., the innerpicture area showing a depiction of interest or about to show adepiction of interest), the size of the inner picture area may beincreased. Based on a depiction of interest within the visual content A902 spanning a large lateral area (compared to the vertical area of thedepiction of interest), the shape of the inner picture area may bechanged to accommodate the large lateral area. For instance, the shapeof the inner picture area may change from a square to a rectangle, suchas the left side of the square moving to the left to extend the width ofthe rectangle.

In some implementations, different visual content may capture content ofdifferent types. For example, visual content captured by front-facingimage sensor may include primary content while visual content capturedback-facing image sensor may include supplemental content. Primarycontent may refer to content of greater importance than supplementalcontent. Primary content may include content that is directly relevantto the subject of visual content capture. Supplemental content may referto content of less importance than primary content. Supplemental contentmay include content that is indirectly relevant to the subject of visualcontent capture. Supplemental content may enhance the primary content.

For example, the image capture device may be used to capture visualcontent of an activity. Primary content may include depiction of theactivity, and supplemental content may include depiction of thingsrelated to and/or associated with the activity. For instance, theactivity may include a sporting activity, and the primary content mayinclude depiction of players engaged in the sporting activity while thesupplemental content may include depiction of crowds at the sportingactivity.

The primary content and the supplemental content may be used to generatenon-spherical visual content. The non-spherical visual content may be avideo edit of the visual content. A video edit may refer to anarrangement and/or manipulation of one or more portions of visualcontent for playback. A video edit may define which portions of thevisual content are included for playback and the order in which theportions are to be presented on playback. For example, referring to theexample of the sporting activity, a video edit generated from theprimary content and the supplemental content may include videofootage/images of the players of the sporting activity (primary content)and video footage/images of the spectators of the activity (supplementalcontent). The supplemental content may be superimposed into the primarycontent, such as shown in FIG. 9A, and/or presented with the primarycontent, such as shown in FIG. 9B.

In some implementations, one of the primary content and/or thesupplemental content may be used to identify when other content shouldbe combined within the non-spherical visual content. The primary contentmay be used to identify supplemental content for integration within thenon-spherical visual content (e.g., video edit). For example, adepiction of interest within the primary content may be used to identifythe corresponding supplemental content. For instance, a depiction ofinterest within the primary content may include a spectacular move by aplayer during a sporting event. The moment and/or the location in whichthe spectacular move occur within the primary content may be used toidentify the moment and/or the location of the corresponding reactionfrom the crowd within the supplemental content for inclusion in thenon-spherical visual content.

The supplemental content may be used to identify primary content forinclusion in the non-spherical visual content (e.g., video edit). Forexample, the presence/occurring of depiction of interest (a moment ofinterest) within the primary content may be identified based on analysisof the supplemental content. For instance, an interesting event may bedetermined to happen within the activity captured in the primary contentbased on reactions of crowd captured in the supplemental content. Thesupplemental content may be used to edit and/or trim the primary contentinto the non-spherical visual content. For example, a “best” scenesvideo edit may be generated from a five-minute footage of primarycontent by editing/trimming down footage of the primary content to athirty-second video edit. The footage of primary content may beedited/trimmed based on reactions (e.g., cheering, positive, coolreactions) found in the supplemental content.

In some implementations, portions of multiple visual content may betemporally arranged within the non-spherical visual content forsequential presentation. For example, images/video frames and/orportions of images/video frames captured by different image sensors maybe arranged in a sequence so that they are presented in a particularorder.

FIGS. 9C and 9D illustrate example timing arrangements of visualcontent. In the timing arrangement shown in FIG. 9C, visual content A922 may be presented first in the non-spherical visual content. Visualcontent B 924 may be presented after the presentation of the visualcontent A 922. In the timing arrangement shown in FIG. 9D, a portion ofvisual content A 922 may be presented first in the non-spherical visualcontent. Visual content B 924 may be presented after the presentation ofthe portion of the visual content A 922. Another portion of the visualcontent A 922 may be presented after the presentation of the visualcontent B 934.

In some implementations, portions of multiple visual content may betemporally arranged for sequential presentation based on audioassociated with one or both of the visual content. For example,referring to FIG. 9D, based on audio recorded with the visual content A933 indicating that a depiction of interest is occurring within thevisual content B 934, presentation of the visual content A 932 maychange to the presentation of the visual content B 934. When thedepiction of interest is over, the presentation may change back to thevisual content A 932.

In some implementations, portions of the multiple visual content may betemporally arranged for sequential presentation based on identificationof one or more depictions of interest within the portions of themultiple visual content. For example, referring to FIG. 9C, a depictionof interest may be identified within the visual content A 922 andanother depiction of interest may be identified within the visualcontent B 924. The depiction of interest within the visual content A 922may occur earlier than the depiction of interest within the visualcontent B 924. Based on the sequence of occurrence of depictions ofinterest, the portions of multiple visual content may be temporallyarranged for sequential presentation as such in FIG. 9C, such that thevisual content A 922 is presented first to include the presentation ofthe depiction of interest within the visual content A 922, and thevisual content B 924 is presented second to include the presentation ofthe depiction of interest within the visual content B 924.

In some implementations, a sequential presentation of portions ofmultiple visual content may include a panning transition from adepiction of interest within one visual content to a depiction ofinterest within another visual content. FIG. 9E illustrates an examplepanning between depictions of interest. As shown in FIG. 9E, a portionof visual content A 942 may include a depiction of interest A 952, and aportion of visual content B 944 may include a depiction of interest B964. A sequence presentation of the portions of the visual content A 942and the visual content B 944 may include a presentation of the depictionof interest A 952 followed by a presentation of the depiction ofinterest B 954. The presentation of the depiction of interest A 952 maychange to the presentation of the depiction of interest B 954 by using apanning 956.

The panning 956 may include a viewing window (punch-out) of the visualcontent moving from being centered on the depiction of interest A 952 tobeing centered on the depiction of interest B 954. While the viewingwindow (punch-out) is entirely within the visual content A 942 or thevisual content B 944, the presentation of the corresponding portion ofthe visual content 942, 944 do not require stitching of the visualcontent 942, 944. When the viewing window (punch-out) includes both thevisual content A 942 and the visual content B 944 (the viewingwindow/punch-out is moving across the boundary of the visual content A942 and the visual content B 944), the visual content 942, 944 (orportions of the visual content 942, 944 within the viewingwindow/punch-out) may need to be stitched. Limiting stitching of thevisual content 942, 944 to those instances in which portions of bothvisual content 942, 944 are being presented may provide for resourcesavings (e.g., computing power, time, memory, power consumption)compared to stitching visual content 942, 944 for the entire duration ofthe presentation.

In some implementations, stitching of the visual content 942, 944 may befurther limited based on the speed of the panning 956 and/or the typesof depictions within the visual content 942, 944. For example, if thepanning 956 is performed at high speed, then lower quality stitching maybe performed for the visual content 942, 944 than if the panning 956 isperformed at low speed. As another example, if the depictions of thevisual content 942, 944 at the boundary across which the viewingwindow/punch-out moves includes non-distinctive features (e.g., sky,ocean, sand), lower quality stitching may be performed while if thedepictions of the visual content 942, 944 at the boundary across whichthe viewing window/punch-out moves includes distinctive features (e.g.,face, persons), higher quality stitching may be performed.

Non-spherical visual content may be generated as encoded visual contentand/or instructions for rendering visual content. For example,non-spherical visual content including multiple portions of visualcontent captured by different image sensors may be encoded for streamingand/or playback. As another example, non-spherical visual contentincluding multiple portions of visual content captured by differentimage sensors may be stored as instructions that identify which portionsof the visual content should be presented and the order in which theportions are to be presented. For example, different visual contentcaptured by different image sensors may be stored separately, and theinstructions may be used to render the presentation of non-sphericalvisual content from the separately stored visual content. Suchgeneration of non-spherical visual content may enable later alterationof the non-spherical visual content from the separately stored visualcontent.

The switch component 106 may be configured to switch the operation ofthe image capture device between the spherical capture mode and thenon-spherical capture mode. Thus, the image capture device may be usedto create both spherical visual content and non-spherical visualcontent, depending on the mode in which it is operating. The switchcomponent 106 may switch the operation of the image capture devicebetween different modes manually and/or automatically. For instance, theswitch component 106 may manually switch the operation of the imagecapture device between different modes based on reception of input(s)from a user to switch between different modes.

The switch component 106 may automatically switch the operation of theimage capture device between different modes based on detection ofconditions for which operation under one of the modes is desirable. Forexample, the switch component 106 may manually switch the operation ofthe image capture device between different modes based on visualinformation of one or more of the image sensors 15A, 15B. For instance,the visual information of one or more of the image sensors 15A, 15B maybe analyzed to determine whether the image capture device should operatein the spherical capture mode, the non-spherical capture mode, and/orother modes, or to change from one mode to another. For example, animage generated through the image sensor B 15B may be dark/black (e.g.,based on the optical element B 14B being covered) and the switchcomponent 106 may switch the operation of the image capture device tonon-spherical capture mode and/or operate the image capture device inthe non-spherical capture mode to capture reduced visual content usingthe optical element A 14A and the image sensor A 15A.

Such setting of operation mode for the image capture device may enable auser to change between spherical capture mode and non-spherical capturemode by simply uncovering/covering one of the optical elements 14A, 14B,rather than by manually setting the operation of the image capturedevice in a particular mode by interacting with interfaces (e.g.,physical/virtual buttons) of the image capture device.

As another example, visual information of the image sensors 15A, 15B maybe compared to determine whether similar or different depiction ofthings are being captured within the visual content. For instance, basedon the image sensors 15A, 15B being used to capture similar visualcontent (e.g., depiction of an activity, depiction of a scenery), theswitch component 106 may switch the operation of the image capturedevice to spherical capture mode and/or operate the image capture devicein the spherical capture mode to capture spherical visual content usingthe optical elements 14A, 14B and the image sensors 15A, 15B. Based onthe image sensors 15A, 15B being used to capture different visualcontent (e.g., depiction of a particular person (user of the imagecapture device) in one visual content, depiction of other thing in theother visual content), the switch component 106 may switch the operationof the image capture device to non-spherical capture mode and/or operatethe image capture device in the non-spherical capture mode.

Such setting of operation mode for the image capture device may enable auser to change between spherical capture mode and non-spherical capturemode based on the context in which the image capture device is used. Forexample, the image capture device being held/carried by a user mayresult in the image capture device operating in the non-sphericalcapture mode to provide a selfie and third-person views of the capturedcontent, while the image capture device being placed in the middle of anactivity may result in the image capture device operating in thespherical capture mode to provide a spherical vide of the activity.Other conditions for switching the operation of the image capture devicebetween the spherical capture mode and the non-spherical capture modeare contemplated.

Implementations of the disclosure may be made in hardware, firmware,software, or any suitable combination thereof. Aspects of the disclosuremay be implemented as instructions stored on a machine-readable medium,which may be read and executed by one or more processors. Amachine-readable medium may include any mechanism for storing ortransmitting information in a form readable by a machine (e.g., acomputing device). For example, a tangible computer-readable storagemedium may include read-only memory, random access memory, magnetic diskstorage media, optical storage media, flash memory devices, and others,and a machine-readable transmission media may include forms ofpropagated signals, such as carrier waves, infrared signals, digitalsignals, and others. Firmware, software, routines, or instructions maybe described herein in terms of specific exemplary aspects andimplementations of the disclosure, and performing certain actions.

In some implementations, some or all of the functionalities attributedherein to the system 10 may be provided by external resources notincluded in the system 10. External resources may include hosts/sourcesof information, computing, and/or processing and/or other providers ofinformation, computing, and/or processing outside of the system 10.

Although the processor 11 and the electronic storage 13 are shown to beconnected to the interface 12 in FIG. 1, any communication medium may beused to facilitate interaction between any components of the system 10.One or more components of the system 10 may communicate with each otherthrough hard-wired communication, wireless communication, or both. Forexample, one or more components of the system 10 may communicate witheach other through a network. For example, the processor 11 maywirelessly communicate with the electronic storage 13. By way ofnon-limiting example, wireless communication may include one or more ofradio communication, Bluetooth communication, Wi-Fi communication,cellular communication, infrared communication, or other wirelesscommunication. Other types of communications are contemplated by thepresent disclosure.

Although the processor 11 is shown in FIG. 1 as a single entity, this isfor illustrative purposes only. In some implementations, the processor11 may comprise a plurality of processing units. These processing unitsmay be physically located within the same device, or the processor 11may represent processing functionality of a plurality of devicesoperating in coordination. The processor 11 may be configured to executeone or more components by software; hardware; firmware; some combinationof software, hardware, and/or firmware; and/or other mechanisms forconfiguring processing capabilities on the processor 11.

It should be appreciated that although computer components areillustrated in FIG. 1 as being co-located within a single processingunit, one or more of computer program components may be located remotelyfrom the other computer program components. While computer programcomponents are described as performing or being configured to performoperations, computer program components may comprise instructions whichmay program processor 11 and/or system 10 to perform the operation.

While computer program components are described herein as beingimplemented via processor 11 through machine-readable instructions 100,this is merely for ease of reference and is not meant to be limiting. Insome implementations, one or more functions of computer programcomponents described herein may be implemented via hardware (e.g.,dedicated chip, field-programmable gate array) rather than software. Oneor more functions of computer program components described herein may besoftware-implemented, hardware-implemented, or software andhardware-implemented

The description of the functionality provided by the different computerprogram components described herein is for illustrative purposes, and isnot intended to be limiting, as any of computer program components mayprovide more or less functionality than is described. For example, oneor more of computer program components may be eliminated, and some orall of its functionality may be provided by other computer programcomponents. As another example, processor 11 may be configured toexecute one or more additional computer program components that mayperform some or all of the functionality attributed to one or more ofcomputer program components described herein.

The electronic storage media of the electronic storage 13 may beprovided integrally (i.e., substantially non-removable) with one or morecomponents of the system 10 and/or removable storage that is connectableto one or more components of the system 10 via, for example, a port(e.g., a USB port, a Firewire port, etc.) or a drive (e.g., a diskdrive, etc.). The electronic storage 13 may include one or more ofoptically readable storage media (e.g., optical disks, etc.),magnetically readable storage media (e.g., magnetic tape, magnetic harddrive, floppy drive, etc.), electrical charge-based storage media (e.g.,EPROM, EEPROM, RAM, etc.), solid-state storage media (e.g., flash drive,etc.), and/or other electronically readable storage media. Theelectronic storage 13 may be a separate component within the system 10,or the electronic storage 13 may be provided integrally with one or moreother components of the system 10 (e.g., the processor 11). Although theelectronic storage 13 is shown in FIG. 1 as a single entity, this is forillustrative purposes only. In some implementations, the electronicstorage 13 may comprise a plurality of storage units. These storageunits may be physically located within the same device, or theelectronic storage 13 may represent storage functionality of a pluralityof devices operating in coordination.

FIG. 2 illustrates method 200 for operating an image capture device. Theoperations of method 200 presented below are intended to beillustrative. In some implementations, method 200 may be accomplishedwith one or more additional operations not described, and/or without oneor more of the operations discussed. In some implementations, two ormore of the operations may occur substantially simultaneously.

In some implementations, method 200 may be implemented in one or moreprocessing devices (e.g., a digital processor, an analog processor, adigital circuit designed to process information, a central processingunit, a graphics processing unit, a microcontroller, an analog circuitdesigned to process information, a state machine, and/or othermechanisms for electronically processing information). The one or moreprocessing devices may include one or more devices executing some or allof the operation of method 200 in response to instructions storedelectronically on one or more electronic storage mediums. The one ormore processing devices may include one or more devices configuredthrough hardware, firmware, and/or software to be specifically designedfor execution of one or more of the operations of method 200.

Referring to FIG. 2 and method 200, an image capture device may includea first image sensor, a second image sensor, and/or other components.The first image sensor may be configured to generate a first outputsignal conveying first visual information based on light that becomesincident thereon. The first visual information may define first visualcontent. The second image sensor may be configured to generate a secondoutput signal conveying second visual information based on light thatbecomes incident thereon. The second visual information may definesecond visual content.

At operation 201, the image capture device may be operated in aspherical capture mode. Operation of the image capture device in thespherical capture mode may include generation of spherical visualcontent based on the first visual content and the second visual content.In some implementation, operation 201 may be performed by a processorcomponent the same as or similar to the spherical mode component 102(Shown in FIG. 1 and described herein).

At operation 202, operation of the image capture device in may beswitched from the spherical capture mode to the non-spherical capturemode. In some implementations, operation 202 may be performed by aprocessor component the same as or similar to the switch component 106(Shown in FIG. 1 and described herein).

At operation 203, the image capture device may be operated in thenon-spherical capture mode. Operation of the image capture device in thenon-spherical capture mode may include generation of non-sphericalvisual content based on the first visual content. In someimplementations, operation 203 may be performed by a processor componentthe same as or similar to the non-spherical mode component 104 (Shown inFIG. 1 and described herein).

Although the system(s) and/or method(s) of this disclosure have beendescribed in detail for the purpose of illustration based on what iscurrently considered to be the most practical and preferredimplementations, it is to be understood that such detail is solely forthat purpose and that the disclosure is not limited to the disclosedimplementations, but, on the contrary, is intended to covermodifications and equivalent arrangements that are within the spirit andscope of the appended claims. For example, it is to be understood thatthe present disclosure contemplates that, to the extent possible, one ormore features of any implementation can be combined with one or morefeatures of any other implementation.

What is claimed is:
 1. An image capture device comprising: a housing; afirst image sensor carried by the housing and configured to generate afirst output signal conveying first visual information based on lightthat becomes incident thereon, the first visual information definingfirst visual content; a second image sensor carried by the housing andconfigured to generate a second output signal conveying second visualinformation based on light that becomes incident thereon, the secondvisual information defining second visual content; a front-facingoptical element configured to guide light within a first field of viewto the first image sensor, the first field of view being greater than180 degrees, the front-facing optical element being carried by thehousing; a back-facing optical element configured to guide light withina second field of view to the second image sensor, the second field ofview including a user of the image capture device when the image capturedevice is held by the user such that the second visual content includesdepiction of the user, the second field of view being greater than 180degrees, the back-facing optical element being carried by the housingsuch that a peripheral portion of the first field of view and aperipheral portion of the second field of view overlap, the overlap ofthe peripheral portion of the first field of view and the peripheralportion of the second field of view enabling spherical capture of visualcontent based on the first visual content and the second visual content;and one or more physical processors carried by the housing, the one ormore physical processors configured by machine-readable instructions to:operate the image capture device in a spherical capture mode, wherein anoperation of the image capture device in the spherical capture modeincludes generation of spherical visual content based on stitching ofthe first visual content and the second visual content; operate theimage capture device in a non-spherical capture mode, wherein anoperation of the image capture device in the non-spherical capture modeincludes generation of non-spherical visual content that includes anextent of the first visual content, wherein the extent of the firstvisual content included within the non-spherical visual content isdetermined based on a gaze of the user depicted within the second visualcontent; and switch between the operation of the image capture device inthe spherical capture mode and the operation of the image capture devicein the non-spherical capture mode.
 2. The system of claim 1, wherein theoperation of the image capture device in the non-spherical capture modefurther includes combination of the extent of the first visual contentand an extent of the second visual content within the non-sphericalvisual content, the extent of the first visual content and the extent ofthe second visual content being time-synchronized based on operation ofthe first image sensor and the second image sensor by the image capturedevice, the extent of the second visual content providing a selfie-viewof the user.
 3. The system of claim 2, wherein the extent of the firstvisual content and the extent of the second visual content are spatiallyarranged within the non-spherical visual content for simultaneouspresentation.
 4. The system of claim 2, wherein the extent of the firstvisual content and the extent of the second visual content aretemporally arranged within the non-spherical visual content forsequential presentation.
 5. The system of claim 4, wherein the extent ofthe first visual content and the extent of the second visual content aretemporally arranged for sequential presentation based on audioassociated with the second visual content indicating a depicting ofinterest occurring within the first visual content.
 6. A method foroperating an image capture device, the image capture device including afirst image sensor configured to generate a first output signalconveying first visual information based on light that becomes incidentthereon via a front-facing optical element having a first field of view,the first visual information defining first visual content, a secondimage sensor configured to generate a second output signal conveyingsecond visual information based on light that becomes incident thereonvia a back-facing optical element having a second field of view, thesecond visual information defining second visual content, the secondfield of view including a user of the image capture device when theimage capture device is held by the user such that the second visualcontent includes depiction of the user, the method comprising: operatingthe image capture device in a spherical capture mode, wherein anoperation of the image capture device in the spherical capture modeincludes generation of spherical visual content based on stitching ofthe first visual content and the second visual content; operating theimage capture device in a non-spherical capture mode, wherein anoperation of the image capture device in the non-spherical capture modeincludes generation of non-spherical visual content that includes anextent of the first visual content, wherein the extent of the firstvisual content included within the non-spherical visual content isdetermined based on a gaze of the user depicted within the second visualcontent; and switching between the operation of the image capture devicein the spherical capture mode and the operation of the image capturedevice in the non-spherical capture mode.
 7. The method of claim 6,wherein the operation of the image capture device in the non-sphericalcapture mode further includes combination of the extent of the firstvisual content and an extent of the second visual content within thenon-spherical visual content, the extent of the first visual content andthe extent of the second visual content being time-synchronized based onoperation of the first image sensor and the second image sensor by theimage capture device, the extent of the second visual content providinga selfie-view of the user.
 8. The method of claim 7, wherein the extentof the first visual content and the extent of the second visual contentare spatially arranged within the non-spherical visual content forsimultaneous presentation.
 9. The method of claim 7, wherein the extentof the first visual content and the extent of the second visual contentare temporally arranged within the non-spherical visual content forsequential presentation.
 10. The method of claim 9, wherein the extentof the first visual content and the extent of the second visual contentare temporally arranged for sequential presentation based on audioassociated with the second visual content indicating a depicting ofinterest occurring within the first visual content.
 11. The system ofclaim 1, wherein the operation of the image capture device in thenon-spherical capture mode further includes processing resource reservedfor the second image sensor being utilized to process the first visualcontent captured by the first image sensor.
 12. The system of claim 1,wherein the operation of the image capture device automatically switchesfrom the spherical capture mode to the non-spherical capture mode basedon the back-facing optical element being covered, covering of theback-facing optical element determined based on the second visualcontent being dark.
 13. The system of claim 1, wherein the operation ofthe image capture device automatically switches between the sphericalcapture mode and the non-spherical capture mode based on similaritybetween the first visual content and the second visual content.
 14. Thesystem of claim 13, wherein the operation of the image capture deviceautomatically switches from the non-spherical capture mode to thespherical capture mode based on the first visual content and the secondvisual content being similar.
 15. The system of claim 13, wherein theoperation of the image capture device automatically switches from thespherical capture mode to the non-spherical capture mode based on thefirst visual content and the second visual content being dissimilar. 16.The method of claim 6, wherein the operation of the image capture devicein the non-spherical capture mode further includes processing resourcereserved for the second image sensor being utilized to process the firstvisual content captured by the first image sensor.
 17. The method ofclaim 6, wherein the operation of the image capture device automaticallyswitches from the spherical capture mode to the non-spherical capturemode based on the back-facing optical element being covered, covering ofthe back-facing optical element determined based on the second visualcontent being dark.
 18. The method of claim 6, wherein the operation ofthe image capture device automatically switches between the sphericalcapture mode and the non-spherical capture mode based on similaritybetween the first visual content and the second visual content.
 19. Themethod of claim 18, wherein the operation of the image capture deviceautomatically switches from the non-spherical capture mode to thespherical capture mode based on the first visual content and the secondvisual content being similar.
 20. The method of claim 18, wherein theoperation of the image capture device automatically switches from thespherical capture mode to the non-spherical capture mode based on thefirst visual content and the second visual content being dissimilar.